Modeling Speech Melody as Communicative Functions with PENTAtrainer2
نویسندگان
چکیده
This paper presents PENTAtrainer2, a semi-automatic software package written as Praat plug-in integrated with Java programs, and its applications for analysis and synthesis of speech melody as communicative functions. Its core concepts are based on the Parallel Encoding and Target Approximation (PENTA) framework, the quantitative Target Approximation (qTA) model, and the simulated annealing optimization. This integration allows it to globally optimize for underlying pitch targets of specified communicative functions. PENTAtrainer2 consists of three computational tools: Annotation tool for defining communicative functions as parallel layers, Learning tool for globally optimizing pitch target parameters, and Synthesis tool for generating speech melody according to the learned pitch targets. Being both theory-based and trainable, PENTAtrainer2 can serve as an effective tool for basic research in speech prosody.
منابع مشابه
PENTATrainer2: A hypothesis-driven prosody modeling tool
Prosody is an essential aspect of speech, as it carries both lexical and non-lexical information. A conventional approach to studying speech prosody is to collect and analyze F0 data based on certain hypotheses and then develop a theory based on the observation, which constitutes the final conclusion of the study. This process is however far from complete, as the developed theory has not been a...
متن کاملSpeech melody as articulatorily implemented communicative functions
The understanding of speech melody, i.e., pitch variations related to both tone and intonation, can be improved by simultaneously taking into consideration two basic facts: that the melody conveys communicative meanings, and that it is produced by human articulators. Communicative meanings, as I will argue, are conveyed through a set of separate functions which are realized by an articulatory s...
متن کاملModelling Japanese intonation using PENTAtrainer2
This paper presents results from Japanese intonation modelling using PENTAtrainer2, an articulatory synthesiser. Our first aim is to show that PENTA, on which PENTAtrainer2 is based, can achieve high accuracy in predictive synthesis of varying intonation contours. We trained the synthesiser on a 6251-sentence functionally annotated corpus and generated F0 contours for each communicative conditi...
متن کاملDiscovering Underlying Tonal Representations by Computational Modeling: a Case Study of Thai
In the present study we test a computational method for investigating underlying tonal representations. The representation explored is in the form of simple linear functions as ideal pitch targets, with which close-to-natural F0 contours can be computationally generated. The estimation of the pitch targets is done with PENTAtrainer2, a hypothesisdriven prosody-modeling tool that combines functi...
متن کاملModeling tone and intonation in Mandarin and English as a process of target approximation.
This paper reports the development of a quantitative target approximation (qTA) model for generating F(0) contours of speech. The qTA model simulates the production of tone and intonation as a process of syllable-synchronized sequential target approximation [Xu, Y. (2005). "Speech melody as articulatorily implemented communicative functions," Speech Commun. 46, 220-251]. It adopts a set of biom...
متن کامل